Tiling for Heterogeneous Computing Platforms
نویسندگان
چکیده
In the framework of fully permutable loops, tiling has been extensively studied as a sourceto-source program transformation. However, little work has been devoted to the mapping and scheduling of the tiles on physical processors. Moreover, targeting heterogeneous computing platforms has, to the best of our knowledge, never been considered. In this paper we extend tiling techniques to the context of limited computational resources with different-speed processors. In particular, we present efficient scheduling and mapping strategies that are asymptotically optimal. The practical usefulness of these strategies is fully demonstrated by MPI experiments on a heterogeneous network of workstations.
منابع مشابه
Tiling for Heterogeneous Computing Platforms Ecole Normale Supérieure De Lyon Tiling for Heterogeneous Computing Platforms Tiling for Heterogeneous Computing Platforms
In the framework of fully permutable loops tiling has been extensively studied as a source to source program transformation However little work has been devoted to the mapping and scheduling of the tiles on physical processors Moreover targeting hetero geneous computing platforms has to the best of our knowledge never been considered In this paper we extend tiling techniques to the context of l...
متن کاملStatic tiling for heterogeneous computing platforms
In the framework of fully permutable loops, tiling has been extensively studied as a sourceto-source program transformation. However, little work has been devoted to the mapping and scheduling of the tiles on physical processors. Moreover, targeting heterogeneous computing platforms has to the best of our knowledge, never been considered. In this paper we extend static tiling techniques to the ...
متن کاملAnalyzing power efficiency of optimization techniques and algorithm design methods for applications on heterogeneous platforms
Graphics processing units (GPUs) have become widely accepted as the computing platform of choice in many high performance computing domains. The availability of programming standards such as OpenCL are used to leverage the inherent parallelism offered by GPUs. Source code optimizations such as loop unrolling and tiling when targeted to heterogeneous applications have reported large gains in per...
متن کاملTiling and Scheduling of Three-level Perfectly Nested Loops with Dependencies on Heterogeneous Systems
Nested loops are one of the most time-consuming parts and the largest sources of parallelism in many scientific applications. In this paper, we address the problem of 3-dimensional tiling and scheduling of three-level perfectly nested loops with dependencies on heterogeneous systems. To exploit the parallelism, we tile and schedule nested loops with dependencies by awareness of computational po...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997